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Abstract- Software reliability is one of the important factors of 
software quality. Before software delivered in to market it is 
thoroughly checked and errors are removed. Every software 
industry wants to develop software that should be error free. 
Software reliability growth models are helping the software 
industries to develop software which is error free and reliable. 
In this paper an analysis is done based on incorporating the 
logistic-exponential testing-effort in to NHPP Software 
reliability growth model and also observed its release policy. 
Experiments are performed on the real datasets. Parameters 
are calculated and observed that our model is best fitted for 
the datasets. 

Keywords- Software Reliability, Software Testing, Testing 
Effort, Non-homogeneous Poisson Process (NHPP), Software 
Cost. 

ACRONYMS 

NHPP : Non Homogeneous Poisson Process 

SRGM : Software Reliability Growth Model 

MVF : Mean Value Function 

MLE : Maximum Likelihood Estimation 

TEF : Testing Effort Function 

LOC : Lines of Code 

MSE : Mean Square fitting Error 

NOTATIONS 

m (t) : Expected mean number of faults detected 

in time (0,t] 
e (t) : Failure intensity for m(t) 
n (t) : Fault content function 
m d (t) : Cumulative number of faults detected upto t 
rn (t) : Cumulative number of faults isolated up to t. 
W (t) : Cumulative testing effort consumption at timet. 
W*(t) : W (t)-W (0) 
A : Expected number of initial faults 

r (t) : Failure detection rate function 
r : Constant fault detection rate function. 

t 1 : Constant fault detection rate in the Delayed S- 

shaped model with logistic-Exponential TEF 
r 2 : Constant fault isolated rate in the Delayed S- 

shaped model with logistic-Exponential TEF 



I. INTRODUCTION 

Software becomes crucial in daily life. Computers, 
commutation devices and electronics equipments every place 
we find software. The goal of every software industries is 
develop software which is error and fault free. Every industry 
is adopting a new testing technique to capture the errors 
during the testing phase. But even though some of the faults 
were undetected. These faults create the problems in future. 
Reliability is defined as the working condition of the software 
over certain time period of time in a given environmental 
conditions. Large numbers of papers are presented in this 
context. Testing effort is defined as effort needed to detect 
and correct the errors during the testing. Testing-effort can 
be calculated as person/ month, CPU hours and number of 
test cases and so on. Generally the software testing consumes 
a testing-effort during the testing phase [20 21]. SRGM 
proposed by several papers incorporated traditional effort 
curves like Exponential, Rayleigh, and Weibull. The TEF 
which gives the effort required in testing and CPU time the 
software for better error tracking. Many papers are published 
based on TEF in NHPP models [4, 5, 8, 11, 120, 12, 20, 21]. 
All of them describe the tracking phenomenon with test 
expenditure. 

This paper we used logistic-exponential testing-effort 
curve and incorporated in the SRGM. The result shows that 
the SRGM with logistic-exponential 

II. SOFTWARE TESTING EFFORT FUNCTIONS 

Several software testing-effort functions are defined in 
literature. w(t) is defined as the current testing effort and 
W(t) describes the cumulative testing effort. The following 
equation shows the relation between the w(t) and W(t) 

W(t) = J*w(t)dt (1) 

The following are some of them 

A. Exponential Testing effort function 

The cumulative testing effort consumed in the time (0,t] 
is [20] 

B. Rayleigh Testing effort curve: 

}Vit) = A : < (1 - ex^-ij;:]) (2) 



©2011 ACEEE 

DOI: 01.IJNS.02.0.254 



38 



^cACEEE 



ACEEE Int. J. on Network Security , Vol. 02, No. 02, Apr 2011 



The cumulative testing effort consumed in the time (0,t] 
is [12,20] 



Wit) = N x (1 - exp [- (*) t 2 ] 



(3) 



The Rayleigh curve increases to the peak and descends 
gradually with decelerating rate. 

C. Logistic-exponential testing-effort: 

It has a great flexibility in accommodating all the forms 
of the hazard rate function, can be used in a variety of 
problems for modeling software failure data. 

The logistic-exponential cumulative TEF over time 
period (0,t] can be expressed as [27] 

Wit) = «X ( -^:-*>\ , t>0 (4) 
"1-1 1? ■ l 

And its current testing effort is 



w{() - 



ccx£xA.xe x \e — 1 J 



t>0 (5) 

C( is the total expenditure . k positive shape parameter and 
A is a positive scale parameter 

III. Software Reliability Growth Models 

A. Software reliability growth model with logistic- 
exponential TEF 

The following assumptions are made for software 
reliability growth modeling [1, 8, 11, 20, 21, 22] 

(i) The fault removal process follows the Non- 
Homogeneous Poisson process (NHPP) 

(ii) The software system is subjected to failure at 
random time caused by faults remaining in the 
system. 

(iii) The mean time number of faults detected in the time 
interval (t, t+At) by the current test effort is 
proportional for the mean number of remaining 
faults in the system. 

(iv) The proportionality is constant over the time. 

(v) Consumption curve of testing effort is modeled by 
a logistic-exponential TEF. 

(vi) Each time a failure occurs, the fault that caused it 
is immediately removed and no new faults are 
introduced. 

(vii) We can describe the mathematical expression of a 

testing-effort based on following 
dmit] 



tit 



w(0 



= r x {a -m(t) ) 



(6) 



m{t)=ax{\-e ! -^ ww - wm) ) 
Substituting W.{X) into Eq.(7) 5 we get 



m(t)=ax 



-rx«x 



(l + (e U '- 



(7) 



(8) 



,)')[ 

This is an NHPP model with mean value function with 
the Logistic -exponential testing- effort expenditure. 
Now failure intensity is given by 



ctm(t) 
dt 



-- a x r x w{t) x e 



r x W{t) 



(?) 



li^-lf -Xt-lti^-lf 



Mi) 



k-\ 



The expected number of errors detected eventually is 



(10) 



(ii) 



B. Yamada Delayed S-shaped model with logistic- 
exponential testing-effort function 

The delayed 'S' shaped model originally proposed by 
Yamada [24] and it is different from NHPP by considering 
that software testing is not only for error detection but error 
isolation. And the cumulative errors detected follow the S- 
shaped curve. This behavior is indeed initial phase testers 
are familiar with type of errors and residual faults become 
more difficult to uncover [1, 6, 15, 16]. From the above steps 
described section 3.1, we will get a relationship between 
m(t) and w(t). For extended Yamada S-shaped software 
reliability model. The extended S-shaped model [24] is 
modeled by 



and 



dm (t) 
j 

ttt 

dm{t) 
dt 



WiWAWAWvWiW VvWvVvWvVvVvWv 

We assume r 2 #iby solving 2 and 3 boundry conditions 

wvw- 

m [t) = « x (l- 



..(»)] 



(12) 



-r * W * (t) 
l 



and 



-r v P#> i n 



k ir- (r) | j 



[ rl-r2 (14) 

At this stage we assume s then using fi L' 

Hospitals rule the Delayed S-shaped model with TEF is 
given by 

«(f)=*(()= ( x(i-(]+, x r*(()) xt [ ' ,,r,( ' 11 ) (l5) 

The failure intensity function for Delayed S-shaped 
model with TEF is given by 

X(t) =ax r 2 x w(l) x W+(t) * o [ ~ r * W * U]] (1 q 

IV. EVALUATION CRITERIA 

A. The goodness of fit technique 

Here we used MSE [5, 11, 17, 23 ] which gives real 
measure of the difference between actual and predicted 
values. The MSE defined as 



\ m ( t i ) 



A smaller MSE indicate a smaller fitting error and better 
performance. 
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B. Coefficient of multiple determinations (R2) 

Which measures the percentage of total variation about 
mean accounted for the fitted model and tells us how well a 
curve fits the data. It is frequently employed to compare 
model and access which model provies the best fit to the 
data. The best model is that which proves higher R 2 . that is 
closer to 1 . 

C The predictive Validity Criterion 

The capability of the model to predict failure behavior 
from present & past failure behavior is called predictive 
validity. This approach, which was proposed by [26], can be 
represented by computing RE for a data set 

* (IE) 
In order to check the performance of the logistic- 
exponential software reliability growth model and make a 
comparison criteria for our evaluations [14]. 

D. SSE criteria: 



(19) 

Where is total number of failures ob sewed at a time tj 
according to the actual data and m(tj) is the estimated 
cumulative number of failures at a time fori=L2 : : n. 



SSE can be calculated as :[17] 



PE J = Actual [ observed ) i - Predicted (estimated) t 



-i 



PE. 



J '(trial it hi 



- t 



PE - Bias} 



MRE = 



l 



estimated ^actual 



(20) 

<21) 
<22) 
(23) 



V. MODEL PERFORMANCE ANALYSIS 
A. DS1: 

The first set of actual data is from the study by Ohba 
1984 [15].the system is PL/1 data base application software 
consisting of approximately l,317,0001ines of code .During 
nineteen weeks of experiments, 47.65 CPU hours were 
consumed and about 328 software errors are removed. 

Fitting the model to the actual data means by esti- 
mating the model parameter from actual failure data. Here 
we used the LSE (non-linear least square estimation) and 
MLE to estimate the parameters. Calculations are given in 
appendix A 




TIME (WEEKS) 

Fig 1 . Observed/estimated logistic-exponential and Rayleigh TEF for 
DSL 

All parameters of other distribution are estimated through 
LSE. The unknown parameters of Logistic-exponential TEF 
are a=72(CPU hours), e=0. 04847, and k=1.387. 
Correspondingly the estimated parameters of Rayleigh TEF 
N=49.32 and b=0.00684/week. Fig.l plots the comparison 
between observed failure data and the data estimated by 
Logistic-exponential TEF and Rayleigh TEF. The PE, Bias, 
Variation, MRE and RMS-PE for Logistic-exponential and 
Rayleigh are listed in Table I. From the TABLE I we can see 
that Logistic-exponential has lower PE, Bias, Variation, MRE 
and RMS-PE than Rayleigh TEF. We can say that our 
proposed model fits better than the other one. In the TABLE 
II we have listed estimated values of SRGM with different 
testing-efforts. We have also given the values of SSE, R 2 
and MSE. We observed that our proposed model has smallest 
MSE and SSE value when compared with other models. The 
95% confidence limits for the all models are given in the 
Table III. 

TABLE -1 

COMPARISON RESULT FOR DIFFERENT TEF APPLIED TO DS1 



TEF 


BIAS 


VARIATION 


MRE 


RMS-PE 


PRESENT 


0.2243 


1.297 


0.033 


1.27 


Rayleigh 


0.830337 


2.169314 


0.052676 


2.004112 



B. DS2: 

The dataset used here presented by wood [2] from 
a subset of products for four separate software releases at 
Tandem Computer Company. Wood Reported that the 
specific products & releases are not identified and the test 
data has been suitably transformed in order to avoid 
Confidentiality issue. Here we use release 1 for illustrations. 
Over the course of 20 weeks, 10000 CPU that SRGM with 
logistic-exponential TEF have less MSE than other models. 
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TAJ3LE-II 

ESTIMATED PARAMETER VALUES AND MODEL COMPARISION FOR DS1 



Models 


a 


r 


SSE 


R- 


MSE 


SRGM -with Logis-tic -exponential TEF 


57S.S 


0.01903 


21S3 


0.9SS9 


12836 


Delayed S shaped model with Loai&tic- 
exponential TEE" 


353.7 


0.0S863 


7546 


0.9615 


443.94 


SRGM with Rayleigh TEF 


459.1 


0.02734 


5100 


0.974 


299. 9S 


Delay ed S ^aped model with Rayleiah TEF 


333 J2 


0.1004 


15170 


0.9226 


S92J2 


G-O model 


760.5 


0.03227 


2656 


0.9365 


156J2 


Yamada Delayed S shaped model 


374.1 


0.1977 


3205 


0.9S37 


1SS.51 



Table m 

95 % C OXFIDEXCE LIMIT F OR DIFFERENT SELECTED MKJSEi^&fD S 1} 



Models 


a 


r 


Lower 


Upper 


Lower 


Upper 


SRGM with Lc^stic-exponenlial 
TEF 


441 5 


716 


00126S 


0.0253 S 


SRGM with Rayleigh TEF 


343.6 


5696 


01651 


0.03317 


Yamada Delayed S shaped Model 
with k££3te-expcaienlial TEF 


314 .5 


392.3 


0.0723 S 


01044 


Y"amada Delayed S shaped Model 
with Rayleigh TEF 


233.7 


377.7 


0.07507 


0.1253 


G-O mod el 


465 4 


1056 


001646 


004303 


yamada Delayed S shaped model 


343.7 


404.4 


0.1748 


0.2205 
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FigJ^, Observed/estimated Logistic-exponential and 
DS2. 

VI. OPTIMAL SOFTWARE RELEASE POLICY 

A. Software Release-Time Based on Reliability Criteria 

Generally software release problem associated with the 
reliability of a software system. Here in this first we discuss 
the optimal time based on reliability criterion. If we know 
software has reached its maximum reliability for a particular 
time. By that we can decide right time for the software to be 
delivered out. Goel and Okumoto [1] first dealed with the 
software release problem considering the software cost- 
benefit. The conditional reliability function after the last 
failure occurs at time t is obtained by 

Rft+At/t)=exp(-[m(t+ AtA)-m(t)T) 
= cxp( -m(At) x cxp( -r x W * (/) ) ) ^4) 

Taking the logarithm on both sides of the above 
equation and rearrange the above equation we obtain 
\nR=-m(At) x cxp( -r x W* (/)) ^5) 



inm(At) - In In j 



»'*(') =— - . 

r I (26) 

By solving the etj 26 we can reach the desired reliability 

level. For DS1 At=0.1 R=0.91 at T=42.1 weeks 



B. Optimal release time based on cost -reliability 
criterion 

This section deals with the release policy based on the 
cost-reliability criterion. Using the total software cost 
evaluated by cost criterion, the cost of testing-effort 
expenditures during software testing/development phase and 
the cost of fixing errors before and after release are: [9, 13, 
251 



+ q | w{x) dr 



C(r} = C m(T) + C 2 \m(T LC )-m(T)' 

(27) 

Where C 1 the cost of correcting an error during testing, 
C is the cost of correcting an error during the operation, C 



C is the cost of testing per unit testing effort 



as Minimize C(T) subjected to R(t+At/t)e" R Q for C 2 > C 1 C 3 
>0,At>0, 0<R <1. 

Differentiate the equation (30) with respect to T and 
setting it to zero, we obtain 



(28) 





' d 




5 






— C(D = C 
dr 1 


— »irj 
[dr J 


IC 

2 


— Mil )- 


— m{T) 

[* J 


+c ff(n 



a 

d 



-c. 



dr 



mm 



+ Cji*(T)=0 



(29) 



■^m(T)=\(T) 



c3 



UT) 
w(T) c2-cl 
art rW(t) = r{a-m(T)) 

When T=0 then m(0)=0 and 

When T->oo, then m{ ao } = a 

And 



(30) 



w(T) 



a r 



,_. = a x r x e 
*V) therefore 



MI) 
W (r) 



IS 



monotonically decreasing in T, 

To analyze the minimum value of C(T) Eq. (27) is used 



Mr) 



to define the two cases of T) 
atT=0. 1) if -M2L= ff Xr < 



< 



w(0) 

C3 



a -a 



, then 



w(r) ~~ C2 - CI foj 0<T<T LC it can be obtained at 
dC( T)/dT>0 for 0<T<T LC and the minimal value can found 
at C(T) can be found at T=0. 



MO) 

if— 7zr=a *r> 



a 



> —^7 = a * r x e 



w(0) C2-C7 w{T) 

there can be found a finite and unique real number 

C3 



i 



■I. r.' i rn 



2 

> c 

1 J 

expenditure and T LC is the software life-cycle length. 

From reliability criteria, we can obtain the required 
testing time needed to reach the reliability objective R Q . Our 
aim is to determine the optimal software release time that 
minimizes the total software cost to achieve the desired 
software reliability. Therefore, the optimal software release 
policy for the proposed software reliability can be formulated 
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(31) 



because dC(T)/dT<0 for 0<T<T and dC(T)/dT>0 for T> T 
5 the minimum of C(T) is at T=T for T < T 

we can easily get the required testing time needed to reach 
the reliability objective R Q . here our goal is to minimize the 
total software cost under desired software reliability and then 
the optimal software release time is obtained. That is can 
minimize the C(T) subjected to R(t+At/t)e" R Q where 0< R Q 
<1 [9,25] 

T* =optimal software release time or total testing time 
=max{T Q T }. Where T Q =finite and unique solution T 
satisfying Eq.(3 1) T 1 =finite and unique T satisfying R(t+At/ 

0=R. 
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By combining the above analysis and combining the cost 
and reliability requirements we have the following theorem. 

Theorem 1: Assume C2<C1<0, C3<0, At>0, andO<R Q 
<1. Let T*be the optimal software release time 



A. if 



MO) a 



Mo) a - a 



and 



HT) -rxa. C3 



w(T) " * ' " C2-CI = 
(r^r^forjf^j <j? <i 



max f 



r Q forO </?<,< , Q 



At 



c - iv(0) C2 - CI 



• w ' ? ItJ " 1 



OforO < 



.. MO) ^ cj th _, 



T. for J? 



*o<*(t] 



A' 



OfarO<^<i| — 



From the dataset one estimated values of SRGM with 
Logistic-exponential TEF a=72(CPU hours), e=0.04847 / 
week, k=1.387, a=578.8 and r=0.01903 when At=0.1 R n 



=0.85 and we let C=2, C2=50, C 3 



=150 and T LC =100 the 



estimated time 1^=37.1 weeks and release time from eq 30 
T Q =39.5 weeks. Now optimal Release Time max (37.1, 39.5) 
is T*=39.5 weeks. Fig 10 shows the change in software cost 
during the time span. Now total cost of the software at optimal 
time 8354. 

From the dataset two estimated values of SRGM with 
Logistic-exponential TEF a=12600(CPU hours), e=0.06352 
/week, k=1.391, a=135.6 and r=0.0001432 when At=0.1 R 
=0.85 and we let C=l, C2=200, C 3 =2 and T LC =100 the 
estimated time T 1 =18. 1 weeks and release time from Eq 31 
T Q =8.05 weeks. Now optimal Release Time max (8.05, 18.1) 
is T*=18.1 weeks. Fig 11 shows the change in software cost 
during the time span. Now total cost of the software at optimal 
time 20,100. 




lOOOC 



Fig 9 Reliability and Total Cost curve (DS1) 

CONCLUSION 

In this paper, we proposed a SRGM incorporating the 
Logistic-exponential testing effort function that is completely 
different from the logistic type Curve. We Observed that most 
of software failure is time dependent. By incorporating 
testing-effort into SRGM we can make realistic assumptions 
about the software failure. The experimental results indicate 
that our proposed model fits fairly well. 
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